AITopics

2605.14168

Country: North America > United States (0.68)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.54)

Kevin G. Jamieson, Daniel Haas, Benjamin Recht

The Power of Adaptivity in Identifying Statistical Alternatives

Neural Information Processing SystemsMay-1-2026, 05:47:07 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, artificial intelligence, machine learning, (16 more...)

Country: North America > United States > California (0.68)

Industry: Information Technology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsApr-29-2026, 11:24:53 GMT

Appendix1 ASpectral Analysis and LTI-SDE2 We consider the Matérn kernel family, 3 κν(t,t0) = a

Consider a general probabilistic43 model with latent parameters θ.

artificial intelligence, machine learning, modeling & simulation, (18 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Modeling & Simulation (0.69)

Neural Information Processing SystemsApr-25-2026, 11:52:12 GMT

Appendix AToy example

In this section, we provide and expand upon a toy example. Recall that the inputs x and x0 need not correspond to real users but could instead represent hypothetical users. Example 5. Suppose that the regulatory guideline requires that users in the same geographical location receive similar weather forecasts. This can be written as "the weather forecasts that are selected by F should be similar for all users in the same geographical location", and S could be a randomly generated set of user pairs, where each pair corresponds to two (hypothetical) users in the same geographical location, and S could contain pairs across many locations. In the left-most panel, a filtering algorithm F takes in counterfactual inputs x and x0 and produces the content Z and Z0. Because a counterfactual regulation requires that F behave similarly under x and x0, the regulation is effectively requiring that content Z and Z0 are sufficiently similar (or, graphically, that they are close in Z). The question of how to quantify "similarity" is addressed in Section 2.1. The toy example in Example 5 is illustrated in the right-most panel.

artificial intelligence, machine learning, regulation, (19 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsApr-25-2026, 07:24:05 GMT

2d290e496d16c9dcaa9b4ded5cac10cc-Paper.pdf

artificial intelligence, fim, machine learning, (19 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Neural Information Processing SystemsApr-22-2026, 15:20:55 GMT

Natural-Parameter Networks: A Class of Probabilistic Neural Networks

Hao Wang, Xingjian SHI, Dit-Yan Yeung

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, npn, (18 more...)

Country:

North America > United States (0.46)
North America > Canada > Ontario > Toronto (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)

Neural Information Processing SystemsApr-22-2026, 12:13:50 GMT

Learning Additive Exponential Family Graphical Models via $\ell_{2,1}$-norm Regularized M-Estimation

Xiaotong Yuan, Ping Li, Tong Zhang, Qingshan Liu, Guangcan Liu

We investigate a subclass of exponential family graphical models of which the sufficient statistics are defined by arbitrary additive forms. We propose two ℓ2,1norm regularized maximum likelihood estimators to learn the model parameters from i.i.d.

artificial intelligence, estimator, machine learning, (14 more...)

Country: Asia (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.49)

arXiv.org Machine LearningApr-15-2026

Information-Geometric Decomposition of Generalization Error in Unsupervised Learning

Kim, Gilhan

We decompose the Kullback--Leibler generalization error (GE) -- the expected KL divergence from the data distribution to the trained model -- of unsupervised learning into three non-negative components: model error, data bias, and variance. The decomposition is exact for any e-flat model class and follows from two identities of information geometry: the generalized Pythagorean theorem and a dual e-mixture variance identity. As an analytically tractable demonstration, we apply the framework to $ε$-PCA, a regularized principal component analysis in which the empirical covariance is truncated at rank $N_K$ and discarded directions are pinned at a fixed noise floor $ε$. Although rank-constrained $ε$-PCA is not itself e-flat, it admits a technical reformulation with the same total GE on isotropic Gaussian data, under which each component of the decomposition takes closed form. The optimal rank emerges as the cutoff $λ_{\mathrm{cut}}^{*} = ε$ -- the model retains exactly those empirical eigenvalues exceeding the noise floor -- with the cutoff reflecting a marginal-rate balance between model-error gain and data-bias cost. A boundary comparison further yields a three-regime phase diagram -- retain-all, interior, and collapse -- separated by the lower Marchenko--Pastur edge and an analytically computable collapse threshold $ε_{*}(α)$, where $α$ is the dimension-to-sample-size ratio. All claims are verified numerically.

artificial intelligence, decomposition, machine learning, (19 more...)

2604.1234

Country:

Asia > South Korea > Seoul > Seoul (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
Europe > Russia (0.04)
(2 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.70)

Guilmeau, Thomas, Hendrikx, Hadrien, Forbes, Florence

Convergence of projected stochastic natural gradient variational inference for various step size and sample or batch size schedules

arXiv.org Machine LearningApr-2-2026

Stochastic natural gradient variational inference (NGVI) is a popular and efficient algorithm for Bayesian inference. Despite empirical success, the convergence of this method is still not fully understood. In this work, we define and study a projected stochastic NGVI when variational distributions form an exponential family. Stochasticity arises when either gradients are intractable expectations or large sums. We prove new non-asymptotic convergence results for combinations of constant or decreasing step sizes and constant or increasing sample/batch sizes. When all hyperparameters are fixed, NGVI is shown to converge geometrically to a neighborhood of the optimum, while we establish convergence to the optimum with rates of the form $\mathcal{O}\left(\frac{1}{T^ρ} \right)$, possibly with $ρ\geq 1$, for all other combinations of step size and sample/batch size schedules. These rates apply when the target posterior distribution is close in some sense to the considered exponential family. Our theoretical results extend existing NGVI and stochastic optimization results and provide more flexibility to adjust, in a principled way, step sizes and sample/batch sizes in order to meet speed, resources, or accuracy constraints.

artificial intelligence, intdoma, machine learning, (16 more...)

2604.00683

Country:

Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Africa > Middle East > Morocco > Tanger-Tetouan-Al Hoceima Region > Tangier (0.04)

Genre: Research Report (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.34)

arXiv.org Machine LearningMar-25-2026

Exponential Family Discriminant Analysis: Generalizing LDA-Style Generative Classification to Non-Gaussian Models

Lakkapragada, Anish

We introduce Exponential Family Discriminant Analysis (EFDA), a unified generative framework that extends classical Linear Discriminant Analysis (LDA) beyond the Gaussian setting to any member of the exponential family. Under the assumption that each class-conditional density belongs to a common exponential family, EFDA derives closed-form maximum-likelihood estimators for all natural parameters and yields a decision rule that is linear in the sufficient statistic, recovering LDA as a special case and capturing nonlinear decision boundaries in the original feature space. We prove that EFDA is asymptotically calibrated and statistically efficient under correct specification, and we generalise it to $K \geq 2$ classes and multivariate data. Through extensive simulation across five exponential-family distributions (Weibull, Gamma, Exponential, Poisson, Negative Binomial), EFDA matches the classification accuracy of LDA, QDA, and logistic regression while reducing Expected Calibration Error (ECE) by $2$-$6\times$, a gap that is structural: it persists for all $n$ and across all class-imbalance levels, because misspecified models remain asymptotically miscalibrated. We further prove and empirically confirm that EFDA's log-odds estimator approaches the Cramér-Rao bound under correct specification, and is the only estimator in our comparison whose mean squared error converges to zero. Complete derivations are provided for nine distributions. Finally, we formally verify all four theoretical propositions in Lean 4, using Aristotle (Harmonic) and OpenGauss (Math, Inc.) as proof generators, with all outputs independently machine-checked by AXLE (Axiom).

artificial intelligence, efda, machine learning, (17 more...)

2603.20655

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.91)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)